Article 5421

Title of the article

SPEECH/PAUSE SEGMENTATION METHOD BASED ON TEAGER ENERGY OPERATOR 

Authors

Alan K. Alimuradov, Candidate of technical sciences, director of student research and production business incubator, Penza State University (40 Krasnaya street, Penza, Russia), E-mail: alansapfir@yandex.ru 

Index UDK

004.934 

DOI

10.21685/2227-8486-2021-4-5 

Abstract

Background. Speech segmentation into voiced, unvoiced sections and pauses is the key task for the majority of speech applications. This is especially important in systems for assessing human psycho-emotional state by speech, since duration of voiced, unvoiced sections and pauses are informative parameters being relevant to naturally expressed human emotions. Materials and methods. The second-order differential Teager energy operator was used, which has a good amplitude that is highly susceptible to changes in signal amplitude and frequency. The method is implemented by means of the program © Matlab (MathWorks). Results. There has been developed a method for speech/pause segmentation to linearly divide a speech signal into fragments, to calculate the energy characteristic using the Teager energy operator, to calculate the values of short-term energy, and determine the «speech/pause» status of fragments based on the calculated threshold values of the short-term energy. There has been carried out a research on the developed method to assess the effectiveness of speech/pause segmentation over the classical method based on the analysis of short-term energy, has been carried out. Conclusions. In accordance with the obtained research results, there is an increase in the efficiency of speech/pause segmentation by 5.26 % and 5.51 % for the 1st and 2nd kind errors, respectively. The proposed speech/pause segmentation method can be effectively tested in systems for assessing human psycho-emotional state due to its good susceptibility to sudden changes in signal amplitude and frequency with unstable vocal motor skills. 

Key words

speech signal processing, speech segmentation, voiced and unvoiced speech, Short-Time Energy, Teager Energy Operator 

 Download PDF
For citation

Alimuradov A.K. Speech/pause segmen tation method based on teager energy operator. Modeli, sistemy, seti v ekonomike, tekhnike, prirode i obshchestve = Models, sys- tems, networks in economics, technology, nature and society . 2021;(4):52–63. (In Russ.). doi:10.21685/2227-8486-2021-4-5

 

Дата создания: 25.03.2022 13:46
Дата обновления: 06.04.2022 13:00